首页> 外文OA文献 >Efficient HTTP based I/O on very large datasets for high performance computing with the libdavix library
【2h】

Efficient HTTP based I/O on very large datasets for high performance computing with the libdavix library

机译:在非常大的数据集上实现高效的基于HTTp的I / O,以实现高性能   使用libdavix库进行计算

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Remote data access for data analysis in high performance computing iscommonly done with specialized data access protocols and storage systems. Theseprotocols are highly optimized for high throughput on very large datasets,multi-streams, high availability, low latency and efficient parallel I/O. Thepurpose of this paper is to describe how we have adapted a generic protocol,the Hyper Text Transport Protocol (HTTP) to make it a competitive alternativefor high performance I/O and data analysis applications in a global computinggrid: the Worldwide LHC Computing Grid. In this work, we first analyze thedesign differences between the HTTP protocol and the most common highperformance I/O protocols, pointing out the main performance weaknesses ofHTTP. Then, we describe in detail how we solved these issues. Our solutionshave been implemented in a toolkit called davix, available through severalrecent Linux distributions. Finally, we describe the results of our benchmarkswhere we compare the performance of davix against a HPC specific protocol for adata analysis use case.
机译:高性能计算中用于数据分析的远程数据访问通常使用专门的数据访问协议和存储系统来完成。这些协议已针对大型数据集,多流,高可用性,低延迟和高效的并行I / O的高吞吐量进行了高度优化。本文的目的是描述我们如何适应通用协议,超文本传输​​协议(HTTP),使其成为全球计算网格(全球LHC计算网格)中高性能I / O和数据分析应用程序的有竞争力的替代方案。在这项工作中,我们首先分析HTTP协议与最常见的高性能I / O协议之间的设计差异,指出HTTP的主要性能弱点。然后,我们详细描述如何解决这些问题。我们的解决方案已在称为davix的工具包中实现,可通过多个最新的Linux发行版获得。最后,我们描述了基准测试的结果,其中我们将davix的性能与针对数据分析用例的HPC特定协议进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号